Promoting Divergent Terms in the Estimation of Relevance Models

نویسندگان

  • Javier Parapar
  • Alvaro Barreiro
چکیده

Traditionally the use of pseudo relevance feedback (PRF) techniques for query expansion has been demonstrated very effective. Particularly the use of Relevance Models (RM) in the context of the Language Modelling framework has been established as a high-performance approach to beat. In this paper we present an alternative estimation for the RM promoting terms that being present in the relevance set are also distant from the language model of the collection. We compared this approach with RM3 and with an adaptation to the Language Modelling framework of the Rocchio’s KLD-based term ranking function. The evaluation showed that this alternative estimation of RM reports consistently better results than RM3, showing in average to be the most stable across collections in terms of robustness.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of SARIMA time series models in monthly streamflow estimation in Idanak hydrometry station

prediction of hydrological variables is a highly effective tool in water resource management. One of the important tools for modeling hydrological processes is the use of time series modeling and analysis. River series production series can be used by time series models in various studies such as drought, flood, reservoir systems design and many other purposes For this purpose, monthly flow dat...

متن کامل

A New Method for Forecasting Uniaxial Compressive Strength of Weak Rocks

The uniaxial compressive strength of weak rocks (UCSWR) is among the essential parameters involved for the design of underground excavations, surface and underground mines, foundations in/on rock masses, and oil wells as an input factor of some analytical and empirical methods such as RMR and RMI. The direct standard approaches are difficult, expensive, and time-consuming, especially with highl...

متن کامل

A comparison of algorithms for maximum likelihood estimation of Spatial GLM models

In spatial generalized linear mixed models, spatial correlation is assumed by adding normal latent variables to the model. In these models because of the non-Gaussian spatial response and the presence of latent variables the likelihood function cannot usually be given in a closed form, thus the maximum likelihood approach is very challenging. The main purpose of this paper is to introduce two n...

متن کامل

Investigation of the Allometric Models in Estimation of Poplar (Populus deltoides) Height

One of the most important issues in forest biometrics is the use of allometric functions to estimate the tree height by using diameter-height models. Measuring the total height of trees is usually a complex and time-consuming process. In allometric functions, the diameter is measured directly but the height of the tree is an estimate of an allometric model, which will be more accurate if the cr...

متن کامل

Estimation of Reference Evapotranspiration Using Artificial Neural Network Models and the Hybrid Wavelet Neural Network

Estimation of evapotranspiration is essential for planning, designing and managing irrigation and drainage schemes, as well as water resources management. In this research, artificial neural networks, neural network wavelet model, multivariate regression and Hargreaves' empirical method were used to estimate reference evapotranspiration in order to determine the best model in terms of efficienc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011